Simplify tests, shorten buildir name on windows #137

mattip · 2024-10-18T04:46:39Z

Taken from #135, without the pytest-xdist module name randomization

This should be smaller and easier to review:

explicitly test c file is not regenerated rather than retesting, should speed up tests
solve windows test crashes due to too-long filenames

mattip · 2024-10-21T05:20:19Z

I am not sure why CI is failing. When I run the tests locally they pass.

arigo · 2024-10-21T06:46:49Z

Force the tests to re-run (unsure how), and see if they still fail? If they do, force a re-run on the base (unsure how) and see if they still pass? If all that is true, then dig deeper.

arigo · 2024-10-25T02:16:12Z

The failure we get is genuine. I think you can get it too by running both test_verify.py and test_vgen.py in the same run. Then it tries to load a library made in the test_verify mode while assuming it was made in the test_vgen mode, or something.

mattip · 2024-10-25T05:34:16Z

When I run the tests locally the cffi.verifier.cleanup_tmpdir() call cleans out the old module. I am not sure why in this test only the cleanup does not work.

arigo · 2024-10-25T07:36:39Z

Maybe the CI runs pytest-xdist anyway? Or some other pytest version, or other plugins, etc.? Something that would make the order of execution different, e.g. test_vgen.setup_module could be called before the actual tests of test_verify run.

mattip · 2024-10-25T08:36:12Z

I am stumped. No pytest-xdist plugin is used:

platform linux -- Python 3.13.0, pytest-8.3.3, pluggy-1.5.0

and the tests are run in the correct order:

  ../../../project/cffi/testing/cffi0/test_verify.py ...................s. [ 42%]
  .....s.....s............................................................ [ 46%]
  .....................................................s...s.............s [ 50%]
  s...                                                                     [ 50%]
  ../../../project/cffi/testing/cffi0/test_version.py .....                [ 51%]
  ../../../project/cffi/testing/cffi0/test_vgen.py ...................s... [ 52%]
  ...s.....s.............................................................. [ 56%]
  ...................................................s...s.............ss. [ 60%]
  .F

When I can I will add debug cruft to cleanup_tempdir(), maybe somehow it is skipping the module in question.

mattip · 2024-10-29T05:59:51Z

Another strange thing I noticed in the codebase: there is both testing/cffi0/test_verify.py and testing/cffi1/test_verify1.py. These files echo each other: they have many of the same tests with slight variations. Some of these repeated tests are quite "expensive" on PyPy's nightly testing on windows. For instance:

test_struct_bad_sized_integer takes 101 seconds
test_global_const_int_size takes ~75 secs

mattip · 2024-10-29T08:11:53Z

Still stumped. I used cos to make sure the test is using a unique function call and print the remaining files in the cleanup_tmp() call at the beginning of test_vgen.py, there are no remnants that might trip up the call to verify(). It still seems like something is being cached from test_verify into test_vgen somewhere.

mattip · 2024-10-29T09:15:41Z

CI was passing so I removed the debug cruft and enabled windows tests. Let's see how long it takes. I am not sure why when using a module name I had to create a unique module name but 🤷. I guess #135 avoided this problem by making all the module names unique.

mattip · 2024-10-29T09:38:05Z

25 minutes for windows tests, I guess that is acceptable? It could be less if the repetitive tests are removed from cffi0/test_verify and cffi1/test_verify1.py

arigo · 2024-10-29T11:40:27Z

cffi0 checks the verify() call that was in cffi 0.x.y, while cffi1 checks the new compile() call. This is very different pieces of code, even if they both implement a large common part.

mattip · 2024-10-29T11:47:23Z

Ahh, makes sense, thanks for clarifying. CI is passing, including enabling windows tests.

mattip · 2025-08-26T13:01:09Z

Rebased on main and cleaned up history. CC @ngoldbaum. I would consider this ready to merge if CI passes, tests on windows do not take too long, and all the tests (except embedding on windows) run. We should maybe open an issue to revert disabling embedded tests on windows (commit 8be5d8a), although if I understand correctly they were not run in CI before this commit anyway.

I am not sure about the changes in the github CI yaml, each platform has a different CIBW_TEST_COMMAND call.
Linux uses

CIBW_TEST_COMMAND: PYTHONUNBUFFERED=1 python -m pytest ${{ matrix.test_args || '{project}' }}

macOS uses

CIBW_TEST_COMMAND: pip install pip --upgrade; cd {project}; PYTHONUNBUFFERED=1 pytest

and windows uses

CIBW_TEST_COMMAND: ${{ matrix.test_cmd || 'python -m pytest {package}/src/c' }}
which this PR changes to
CIBW_TEST_COMMAND: ${{ matrix.test_cmd || 'python -m pytest {package}' }}

mattip · 2025-08-26T13:36:25Z

CI test comparison:

platform	passed	skipped	xfailed	time
windows	1487	221	4	1332.06s (0:22:12)
macos	1584	135	4	391.34s (0:06:31)
linux	1622	97	4	206.63s (0:03:26)

So windows is still ~8x slower than linux while running fewer tests 🤷

mattip · 2025-08-26T13:43:09Z

testing/cffi0/test_verify.py

+    assert not os.path.exists(cfile)
+    lib = ffi.verify('#include <math.h>', libraries=lib_m, modulename=modulename)
+    assert lib.cos(1.23) == math.cos(1.23)
+    assert not os.path.exists(cfile)


This is the tricky part of this PR. As @arigo commented in #137

If you want to kill the test_v*2.py files and replace them with a single test that checks no C code is regenerated, then I'm fine with it.

so I did that.

ngoldbaum · 2025-08-26T14:22:26Z

I wonder if setting PYTHONBUFFERED=1 would make the windows tests even slower.

I personally find the linux version the most readable. The pip upgrade should happen elsewhere probably on Mac.

It looks like test_cmd is unused, so you can delete the matrix.test_cmd bit in the Windows invocation.

mattip · 2025-08-26T16:14:14Z

I wonder what the original intent of test_cmd was? @nitzmahone?

nitzmahone · 2025-09-05T22:07:28Z

I wonder what the original intent of test_cmd was?

Just to allow individual Windows matrix entries to override the test command. I'll sometimes enable a larger subset of tests for a small number of targets to verify things that are too slow to do for all Windows targets, esp since we often can't run all targets in parallel. I'll begrudgingly deal with a couple of 20-40 min test runs, but I don't want to sit around waiting for 20 of them running serially 😆.

Running full tests on emulated arches is currently the largest contributor to the overall build time, which is usually why I'll end up turning those back down to just some minimal smoke tests against the built artifact. If it was just those, I wouldn't complain, but they tend to clog up the works because they'll eventually all run at the same time and block shorter targets from running. We could probably get fancy with concurrency groups to try and minimize the total runtime, but that's been a pretty low priority- I usually settle on something like at least one full test run for each platform/arch for release builds. Doing full tests on every Python/platform/arch would be nice, but there's definitely diminishing returns, especially with emulation or slow targets.

mattip · 2025-09-06T03:24:31Z

Ahh, cool, so let’s leave that part alone.

arigo mentioned this pull request Oct 18, 2024

rename possibly identical modules to prevent test collisions #135

Closed

mattip closed this Oct 22, 2024

mattip reopened this Oct 22, 2024

mattip force-pushed the simplify-tests branch from f40b90e to 4e7ebbd Compare October 29, 2024 09:06

ngoldbaum mentioned this pull request Aug 25, 2025

Release CFFI 2.0.0 #187

Closed

3 tasks

mattip added 8 commits August 26, 2025 15:39

refactor tests to explicitly test no regeneration of C files

4b65b58

when building, cd into tmpdir to shorten builddir name on windows

28339aa

also remove '*.o' in cleanup_tmp, enable more windows tests

df46ee1

switch function name in test

672454d

make doubly sure the module is recompiled

8278928

fix windows tests for distutils from setuptools>73

ab338cc

skip embedding tests on windows

8be5d8a

make filename unique

1e3f086

mattip force-pushed the simplify-tests branch from 42f6c96 to 1e3f086 Compare August 26, 2025 12:48

mattip commented Aug 26, 2025

View reviewed changes

Simplify tests, shorten buildir name on windows #137

Are you sure you want to change the base?

Simplify tests, shorten buildir name on windows #137

Uh oh!

Conversation

mattip commented Oct 18, 2024

Uh oh!

mattip commented Oct 21, 2024

Uh oh!

arigo commented Oct 21, 2024

Uh oh!

arigo commented Oct 25, 2024

Uh oh!

mattip commented Oct 25, 2024

Uh oh!

arigo commented Oct 25, 2024

Uh oh!

mattip commented Oct 25, 2024

Uh oh!

mattip commented Oct 29, 2024

Uh oh!

mattip commented Oct 29, 2024

Uh oh!

mattip commented Oct 29, 2024

Uh oh!

mattip commented Oct 29, 2024

Uh oh!

arigo commented Oct 29, 2024

Uh oh!

mattip commented Oct 29, 2024

Uh oh!

mattip commented Aug 26, 2025

Uh oh!

mattip commented Aug 26, 2025

Uh oh!

mattip Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

ngoldbaum commented Aug 26, 2025

Uh oh!

mattip commented Aug 26, 2025

Uh oh!

nitzmahone commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattip commented Sep 6, 2025

Uh oh!

Uh oh!

nitzmahone commented Sep 5, 2025 •

edited

Loading